AITopics | weight initialization

Collaborating Authors

weight initialization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FunctionallyRegionalizedKnowledgeTransfer forLow-resourceDrugDiscovery

Neural Information Processing SystemsApr-25-2026, 16:21:23 GMT

The compositionality of the model improves the capacity of generalization tovarious andevenout-of-distribution tasks.

artificial intelligence, assay, machine learning, (19 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

39d4b545fb02556829aab1db805021c3-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 12:24:33 GMT

artificial intelligence, optical flow, variant, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.99)

Add feedback

CLAP4 CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language Models

Neural Information Processing SystemsFeb-18-2026, 13:30:21 GMT

This makes them overlook the many possible interactions across the input modalities and deems them unsafe for high-risk tasks requiring reliable uncertainty estimation.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Oceania > Australia > New South Wales (0.04)
Europe > France (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
(3 more...)

Add feedback

88c2fb9ae705e143c99156ba37926837-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 10:31:12 GMT

artificial intelligence, machine learning, neural network, (16 more...)

Neural Information Processing Systems

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

DirichletEnergyConstrainedLearningforDeep GraphNeuralNetworks

Neural Information Processing SystemsFeb-10-2026, 20:45:59 GMT

However,theperformance ofexisting GNNs would decrease significantly when they stack many layers, because of the oversmoothing issue. Node embeddings tend to converge to similar vectors when GNNs keep recursively aggregating the representations ofneighbors.

artificial intelligence, arxivpreprintarxiv, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

S 3 : Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks

Neural Information Processing SystemsFeb-9-2026, 11:27:57 GMT

Shift neural networks reduce computation complexity by removing expensive multiplication operations and quantizing continuous weights into low-bit discrete values, which are fast and energy-efficient compared to conventional neural networks. However, existing shift networks are sensitive to the weight initialization and yield a degraded performance caused by vanishing gradient and weight sign freezing problem. To address these issues, we propose S$^3$ re-parameterization, a novel technique for training low-bit shift networks.

artificial intelligence, machine learning, sign-sparse-shift reparametrization, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.55)

Add feedback

39d4b545fb02556829aab1db805021c3-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 06:45:39 GMT

artificial intelligence, optical flow, variant, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.99)

Add feedback

Sheaf Cohomology of Linear Predictive Coding Networks

Seely, Jeffrey

arXiv.org Artificial IntelligenceNov-17-2025

Predictive coding (PC) replaces global backpropagation with local optimization over weights and activations. We show that linear PC networks admit a natural formulation as cellular sheaves: the sheaf coboundary maps activations to edge-wise prediction errors, and PC inference is diffusion under the sheaf Laplacian. Sheaf cohomology then characterizes irreducible error patterns that inference cannot remove. We analyze recurrent topologies where feedback loops create internal contradictions, introducing prediction errors unrelated to supervision. Using a Hodge decomposition, we determine when these contradictions cause learning to stall. The sheaf formalism provides both diagnostic tools for identifying problematic network configurations and design principles for effective weight initialization for recurrent PC networks.

artificial intelligence, machine learning, optimization problem, (17 more...)

arXiv.org Artificial Intelligence

2511.11092

Genre: Research Report (0.40)

Industry: Law > Litigation (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Add feedback

Weights initialization of neural networks for function approximation

Hu, Xinwen, Huang, Yunqing, Yi, Nianyu, Yin, Peimeng

arXiv.org Artificial IntelligenceOct-13-2025

Neural network-based function approximation plays a pivotal role in the advancement of scientific computing and machine learning. Yet, training such models faces several challenges: (i) each target function often requires training a new model from scratch; (ii) performance is highly sensitive to architectural and hyperparameter choices; and (iii) models frequently generalize poorly beyond the training domain. To overcome these challenges, we propose a reusable initialization framework based on basis function pretraining. In this approach, basis neural networks are first trained to approximate families of polynomials on a reference domain. Their learned parameters are then used to initialize networks for more complex target functions. To enhance adaptability across arbitrary domains, we further introduce a domain mapping mechanism that transforms inputs into the reference domain, thereby preserving structural correspondence with the pretrained models. Extensive numerical experiments in one- and two-dimensional settings demonstrate substantial improvements in training efficiency, generalization, and model transferability, highlighting the promise of initialization-based strategies for scalable and modular neural function approximation. The full code is made publicly available on Gitee.

artificial intelligence, fuzzy logic, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2510.0878

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Technology: